Full Connectivity: Corners, edges and faces 



Justin Coon, 1 Carl P. Dettmann and Orestis Georgiou 3 * 

1 Toshiba Telecommunications Research Laboratory 
32 Queen Square, Bristol BS1 4ND, United Kingdom 

2 University of Bristol School of Mathematics 
University Walk, Bristol BS8 1TW, United Kingdom 
3 Max-Planck-Institute for the Physics of Complex Systems 
Nothnitzer Str. 38, 01187 Dresden, Germany 

*To whom correspondence should be addressed. 
Abstract 

We develop a cluster expansion for the probability of full connectivity of high den- 
sity random networks in confined geometries. In contrast to percolation phenomena at 
lower densities, boundary effects, which have previously been largely neglected, are not 
only relevant but dominant. We derive general analytical formulas that show a persis- 
tence of universality in a different form to percolation theory, and provide numerical 
confirmation. We also demonstrate the simplicity of our approach in three simple but 
instructive examples and discuss the practical benefits of its application to different 
models. 

1 Introduction 

Percolation is a phase-transition phenomenon in large random networks whereby at a critical 
value p c of a parameter such as density that controls the connection probabilities, the 
largest connected component (cluster) of the system experiences a sudden change from 
being independent of system size (microscopic) to being proportional to it (macroscopic). 
For example in infinite systems with densities p < p c (sub-critical), all clusters are finite 



almost surely while for p > p c (super-critical), any given node is in an infinite cluster with 
positive probability. As with other statistical phase transitions, in the thermodynamic limit 
of large system size, the percolation density p c is very much independent of the size and 
shape of the system and in general of the microscopic details of the model leading to the 
phenomenon of universality. Percolation theory does not, however, address the question 
of when finite systems are fully connected, that is, the probability of the entire system 
comprising a single cluster. Here we develop a theory of the latter, showing that boundary 
effects are crucial for understanding this problem, and demonstrate a general formalism for 
calculating their contributions. 

The well developed theory of percolation [lj finds its historic roots in the early 1930s 
in studies by physicists and chemists concerned with nucleation and condensation of gases 
into liquids as well as clustering of interacting particles and colloids. The theory of per- 
colation was then initiated in 1957 in order to study random physical processes such as 
fluid flow through disordered porous media [2J. A more general mathematical approach 
was formulated soon thereafter in 1959 in the form of random graphs [3J using the famous 
Kolmogorov zero-one law in probability theory to state that given a infinite graph with 
connections chosen randomly and independently, the existence probability of an infinite 
cluster is either zero or one. A wealth of different models and approaches as well as out- 
standing open problems can be found in [I]. The exact statistical mechanical formalism of 
continuum percolation on a lattice-free basis was eventually formulated in the late 1970s [5] 
and has been extensively applied in a wide variety of settings as the theory provides useful 
information on cluster statistics (for a review article see [6]). 

The triumph of percolation theory and statistical physics [7] in describing particle clus- 
tering in liquids [8] , gases [10] and colloids [11] , was also successfully adopted in studies 
of electrical conductance in disordered media. These have included investigations of trans- 
port in carbon nano-tube networks |12j and metallic insulation in composite materials |13j . 
The concept of quantum percolation and its connections with the quantum Hall effect [2] 
as well as phenomena such as spontaneous magnetization |15j are but a few of the many 
recent theoretical advancements. Moreover, percolation theory has gone beyond physical 
sciences to topics such as network modeling. These include interpreting (self)-organization 
processes of complex networks, such as those displayed by functionally related proteins 
[T6] ; the spread of forest fires [171 02] , epidemics [HI [201 EI] > computer [22] and phone [23] 
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Figure 1: (Color online) A single high density realization is shown with iV = 450,500 and 600 nodes 
(represented as balls) randomly placed inside a cubed domain of side L — 10. Lighter colors indicate a 
higher probability of being in the largest connected component; note that isolated nodes (darker colors) are 
concentrated at the edges and corners at higher densities. 

viruses; industrial and economic sectors [24J; and social groups of people [23| , [27]. 

The current work is primarily motivated by the recent adaptation of percolation theory 
in wireless communications and multihop relay networks. These consist of a number of 
communication devices (nodes) which can pass messages to each other without the need of 
a central router. Multihop relay networks can therefore achieve good and reliable coverage 
and connectivity over a large area even when some nodes are moved or deactivated. Most 
models of such systems are closely related or derived from studies by probabilists under the 
topic of random geometric graphs or networks. Percolation theory, both on a lattice and 
continuum has thus been previously applied to random networks by engineers identify and 
analyze power management techniques (281 [29] , network resilience [30j , efficient relay place- 
ment [31], coverage and connectivity in wireless sensor networks [32J (a direct adaptation 
of [10j), and the information theoretic capacity of networks |33j . 

A major weakness of the theory which is immediately evident in the current context is 
that many networks, although large, are not of infinite size and are often confined within 
a finite region. Therefore infinite (or 'giant') clusters and critical percolation densities no 
longer make sense in the usual thermodynamic limit. Instead, the more suited notion of full 
connectivity (originally posed in [3]) is addressed, typically in some asymptotic regime such 
as in the extended network and the dense network models (see |34j and references therein) . 
Both of these models scale the system size exponentially with density and hence manage to 
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Figure 2: (Color online) Five equally spaced cross sections of the average connectivity (averaged over 
100 different realizations) inside a cubed domain of side L = 10 with node density p = 0.5. The outer plots 
correspond to opposite faces of the cube. The same color scheme as in Figure [T] is used with light and dark 
colors indicating good and bad connectivity respectively. It is clear that corners, edges and faces of the cube 
constitute critical regions where full connectivity is lost with maximum probability. 



ignore any boundary effects. However in the natural limit that we consider, where system 
size increases with density more slowly than exponential or not at all, boundary effects 
dominate. 

The above observation is emphasized in Figure [TJ which shows a single realization 
of (from left to right), 450, 500 and 600 nodes (represented as overlapping balls) placed 
randomly inside a cubed domain, with each ball colored according to the total probability 
of it connecting to any of the other N — 1 nodes surrounding it. We assume here an 



exponentially decaying (with distance) connection probability function (see equation (19)). 
Notice that isolated, and hence hard to connect, nodes (black) are in this case near corners, 
edges or faces of the cubed domain. Averaging over 100 such realizations and producing 
density plots of cross-sections of the cubed domain (see Figure [2]) clearly demonstrates that 
these are critical regions where full connectivity is lost with maximum probability. That 
is, to leading order in the high density limit, the probability of obtaining a fully connected 
network is controlled by the probability of a single isolated node. Lower density corrections 
to this are due to two, three and larger isolated clusters. This observation has formed 
the basis of our recent work [35J, where we presented a new cluster-expansion approach 
to analyzing connectivity in confined geometries. This approach allowed for a consistent 
analysis and inclusion of the local geometric boundary effects leading to a general closed- 
form analytical formula for calculating the probability of full connectivity Pf c . 

The current paper complements our previous results by providing additional details of 
our approach in a more general setting, extends them with further calculations of second 
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order correction terms, and also demonstrates the inaccuracy of conventional bulk models 
through three simple, yet instructive example network confinements (circle, square and 
sector) which are numerically confirmed for both probabilistic and sharp (unit-disk type) 
pair connectedness functions. The structure of the paper is as follows: In section 2, we derive 
a high density cluster approach which we then expand up to second order in subsections 
2.1, 2.2 and 2.3 assuming a homogeneous system. We then compute the overall probability 
of a fully connected network for a given exponentially decaying connectivity function in 
subsection 2.4. In section 3, we lift the homogeneity assumption and consider boundary 
effects explicitly illustrating their importance for any connectivity probability function that 
decays suitably fast. We then work through three examples (subsections 3.2, 3.3 and 3.4) for 
the previously given connectivity function and provide plots of comparisons with numerical 
simulations. These examples lead to the construction of a general formula for the overall 
probability of a fully connected network in subsection 3.5. Finally in section 4 we summarize 
and conclude with a short discussion. 



2 A cluster expansion approach 

In this section we develop a high density connectivity model based on a cluster expansion 
approach which is able to produce closed-form analytical results in some generality. We 
assume a homogeneous network and perform the expansion up to second order and comment 
on the third and higher order corrections. First however we need to set the stage and define 
some useful quantities. 

Consider N randomly distributed nodes with locations r« G V, a convex subset of R d , 
where i = 1,2, ...N, according to a uniform density p = N/V, where V = \V\ and | • | 
denotes the size of the set using the Lebesgue measure of the appropriate dimension or 
the cardinality of a finite set. We assume that nodes i and j are directly connected with 
probability H(rij), often written just Hij where r,- L j = | r j — r ^ | is the distance between i and 
j- 

We may also consider a more general setting, as follows: M. d could be replaced by a 
Riemannian manifold Ai d with volume element derived from its metric tensor. The direct 
connection distance function rjj may or may not be given by geodesic distance derived from 
the metric tensor. The convexity assumption is then replaced by the statement that this 



5 



distance function is a metric in the mathematical sense, i.e. symmetric, non-negative, zero 
only when the nodes coincide, and satisfying the triangle inequality^ One example of a 
non-Euclidean manifold and discussion of its curvature effects is discussed in section 2.4. 
Otherwise we will use only Euclidean space 
We define an average over all configurations 

{A} = —*f A(ri,r2,...,r/v)dridr 2 ...drjv- (1) 

V Jv N 

We also need notation to define the relevant graphs. Let S = {1, 2, 3, ... , N}. A graph g = 
(A,L) consists of a set A C S of nodes, together with a collection L C G A : i < j} 

of direct links, that is unordered distinct pairs of nodes. As a slight abuse of notation we 
write € g to denote that is an element of the set of links L associated with the 
graph g. We write G A for the set of graphs with nodes in A, and G A for the set with nodes 
in A and largest connected component (cluster) of size j with 1 < j < \A\. 

The probability that two nodes are connected or not leads to the trivial identity: 

1 = Hij + (1 - Hij). (2) 

Multiplying over all links with nodes in a set A expresses the probability of all possible 
combinations. This can be written as 

i= n [Hij + { i- Htj)]= £ n 9 , (3) 

ijeA;i<j g eG A 

where 

n «= n ff « n (!-^)- (4) 

The sum in equation ([3]) contains 2' j4 '('" 4 '~ 1 ^ 2 separate terms. Setting A = S, this can be 
expressed as collections of terms determined by their largest cluster: 

E • 

9£Gf (5) 




1 If the convexity assumption is relaxed it is a semi- metric function, i.e. the triangle inequality may not 
be satisfied. For example, r%j is effectively infinite if there is an obstacle blocking the path of a wireless 
signal, but adding a node k that avoids the obstacle may permit indirect connection between i and j, thus 
in this case r»j > r\k + rkj- We defer the treatment of obstacles to a future paper. 
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For a given configuration of node positions G V, assuming that the nodes are pair- 
wise connected with independent probabilities Hij, the first term in equation ^ is the 
probability of being fully connected Pf c (r\, . . . rjv)- The average of this quantity over all 
possible configurations Pf c = (P/ C (ri, . . . rjv)) is the overall probability of obtaining a fully 
connected network and is our desired quantity of interest. Hence, rearranging equation ^ 
allows us to obtain expressions for Pf c in a consistent way while keeping track of correction 
terms. 

2.1 Zeroth order approximation 

In the very high density limit of p — > oo, the right hand side of equation ^ is dominated 
by the first term 



and hence the network is fully connected with probability one. The approximation symbol 
is used here and from now on to indicate that higher order terms are being ignored. 

2.2 First order approximation 

The first order approximation is obtained when the second term in equation ([5]) is expanded 
out explicitly. This takes into account all the ways of having an N — 1 cluster of nodes. 
Thus the overall probability of a fully connected network is 




(6) 
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(7) 



l-N(U(l-H jN )) 



3=1 




N-l 
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since all nodes are identical and therefore the sum over p can be factored out. 

Assuming that the network is homogeneous (i.e. there are no boundary effects and 
therefore the system is symmetric under translational transformations) allows for the change 
in variables r = ri — r^v so the integrals decouple and we get 

N-l 



P /c «l-tf fl-ijf #(r)dr) 



1+ N\ P .l v H{r)dr 2 ) + °{n-i 



for N large. 

At this point we are in a position to discuss the various scaling limits and approximations 
appearing in the theory. Until now we have assumed only that the probability of full 
connectivity is high, which is reasonable for typical applications such as wireless networks. 
This means that pVn is large where Vjj = JyH(r)dr is the effective connectivity volume 
associated with H(r). Increasing N and V at constant p will however decrease Pt c due 
to the factor of N in front of the exponential. We see that at fixed Pf c , system size 
scales exponentially with density. While this is a popular scaling in the literature, it has 
the effect of hiding boundary effects, which we will see are very important. Alternatively 
we can increase p while keeping V fixed. For this purpose, and for any scaling in which 
V/p decreases, we must use the first line in equation Q as the second line clearly does not 
converge. However it is also advantageous to assume V/Vjj is large, for example to separate 
effects due to the range of direct connections which is proportional to vj/ d , and effects due 
to the finite extent of the whole system. In this case, so for any scaling where V is between 
linear and exponential in p, we can use the simpler second line of equation ([8]). We will 
take the latter approach unless otherwise stated. 

2.3 Second order approximation 

The second order approximation involves enumerating the N — 2 cluster terms, together 
with the first approximation of the term in equation ([7]) such that 



Pfc*l-{ E Hg)-{ £ H g ). 



(9) 

N-l 
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We first examine the first approximation of the Gfj_ l term thus including all possible ways 
of getting an N — 2 cluster in a N — 1 node network: 



and 



( E w » 
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(End 
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< E ««> = ^EII( 1 -^) E 



*=1 



(10) 



(11) 



equation (10) out we have 



In this way we avoid double counting of different decompositions of subgraphs. Expanding 

(7V-2)terms \ 

^ , ~ v /- 

cna-^E] 



N 



< E ^) = (EIK 1 -^>) 



i 



P =1 



( (N— l)terms 

' N , , , s 

eii^-^e n^-^) 



N(H(1 - H l3 ) - N{N 2 l) I (1 - H 12 ) H Xj ){l - H 2j ) 

3=2 ' \ i=3 



(12) 



since all nodes are identical. 

We now consider the last term in ^ . There are two possible ways of getting an N — 2 
cluster. These correspond to having two isolated nodes and one large N — 2 cluster, or 
having a small 2-node cluster and a large N — 2 cluster. Hence, adding these two (disjoint) 
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possibilities together we have that 



37% 



E «p 



N 

= ( 2 ~ 1} ((1 " #«) 11(1 " - ff-y)) 

3=3 




(13) 



3=3 



7V(iV - 1) 



A? 



(IJ(l--ffli)(l-^))> 



3=3 



due to the identity of equation Q. 

Putting it together we find a clear sum of zeroth, first and second order terms: 

P fc « 1 - iV(Q(l - H W )> - 1 V 12 !!(! - ff y)(! " (14) 

3=2 3=3 

This is our most general result, allowing any dimension, region, and suitably decaying 
function H(r). Assuming that the network is homogeneous as before gives 



7c 



1 - N 1 



1 

V 



H(r)dr 



N-l 



N(N — 1) 
2V 



H(r) 1 



K(r) 
V 



N-2 



dr (15) 



where 



Kin) = V — f(l- F(r 2 ))(l - F(r 12 ))dr 2 
Jv 



(16) 



is a quantity that remains bounded as V increases. For sufficiently large V (see Q above) 
we can approximate this as 



P N 



H(r)e- pK ^dr. 



(17) 
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2.4 The H(r) function and connectivity on a sphere 

Translational invariance is unlikely to be a good assumption in practice, except if the nodes 
are situated on the surface of some n-sphere S n C with n > 1 such that the system 

is symmetric under rotations about the origin. Here, we consider the realistic case of n = 2 
for a specific connectivity function related to wireless communication networks, which is 
exponentially decaying with distance. 

The information outage probability P ou t for a single-input single-output (SISO) Rayleigh 
fading link modeQ is given by {36] 

P out = Pr (log 2 (1 + SNR x \h\ 2 ) < x) 

( V-W ( 18 ) 

= Pr(|/ l | 2 < ' X 



SNR 

where h is the channel transfer coefficient, x is the minimum outage rate threshold, and the 
signal-to-noise ratio (SNR) oc r _T? where r is the dimensionless distance between connected 
nodes (relative to the signal wavelength), and n is an environment dependent decay param- 
eteij^] The random variable \h\ 2 is typically drawn from a standard exponential distribution. 
Therefore the connectivity probability function can be written as 

H(r) = l-P out = e- fSr \ (19) 

with f3 typically a small dimensionless constant, and is plotted in Figure [3] for f5 = 0.01 and 

different values of n. Notice that as n — > oo the connectivity is no longer probabilistic and 

converges to the interesting case of the popular unit disk model where connections have a 
_ i 

fixed range of ro = (3 i . 

The surface element on a sphere of radius R is R 2 sin 9d9d(p in the usual spherical 
coordinates. First we use the Euclidean distance in M 3 between two nodes, given in terms 
of the angle they subtend at the center: d(r\, r 2 ) = |ri2 1 = V2R\/l — cos 612 ■ Setting 77 = 2 
(corresponding to propagation in free space) and locating one node at the pole 9 = we 



2 Other link models can also be considered. These may include single-input multiple-output (SIMO), 

multiple- input single-output (MISO), and multiple-input multiple-output (MIMO) (see also [37]). 

3 Typically 77 = 2 corresponds to propagation in free space but for practical reasons it is often modeled 

as 77 > 2 for cluttered environments. 



11 




Figure 3: The probability of connecting a distance r from a node is plotted for r\ = 2, 3, 4, 5, 6, form right 
to left, using j3 = 0.01. 



therefore have that 



ff(r)dr 



2-7T /»7r 



JO 
1 - e 



e -2/3-R 2 (l-cos0)#2 gin ^ d 



(20) 



when j3R 2 (proportional to V/Vh above) is large. 

If instead geodesic (great-circle) distance is considered rather than Euclidean we have 
<i(ri,r2) = R0 and hence 



s-' 



F(r)dr 
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7V p 


Jo 


Jo 




1 - 


P 



e -(3(R0) 2 R 2 sin0d0d ^ 
/3tt 2 R 2 ^ 



1 



1 e 

+ 



(3R 2 \ 6 2tt 2 



7T 



(21) 



again when (3R 2 is large. In this limit we see that the curvature (independent) effects 
become small for both distance functions. It is thus possible to approximate the neighbor- 
hood of each node by a flat Euclidean metric, more generally for other smooth manifolds 
A4 d , assuming a sufficiently fast decay of H(r). 

Since the dominant contribution of J v iT(ri )iT(ri2)dri is when nodes 1 and 2 are close to 
each other (i.e. when <Z(ri, r2) <C R), we adopt the above approximation in order calculate 
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iT(r 2 ) using polar coordinates 

K(r 2 )= [ HM + Hirrd-HirjHiruddr! 
Js 2 

[ i/(ri) J H'(ri 2 )dr 1 
P Jr 2 



/*00 p27T 

2-- / rie -/3(2rf+r 2 2 -2r 1 r 2 cose) dMri ^ 

P Jo Jo 

roc 

2~- / 2^r 1 e- /3 ( 2r ? +r ^/ (2/3r 1 r 2 )dr 1 
P Jo 



7T 7T _ d 

2— -e p 2 



/3 2/3 

where Io(:c) is the modified Bessel function of the first kind, r\ and r 2 are the distances of 
the two nodes from the origin and 9 the angle between them. Finally, we may evaluate 

f H{T)e- pK{r) dr^ [ H{T)e- pK(r) dr 

Js 2 Jr 2 {0 o\ 

3.p / 2 , { ' 

= e _ + o09" 2 



by expanding the exponential in equation (22) around r 2 = as this is near the main 



contribution to the integral. Altogether, according to equation (17) this gives 

P fc ^l-Ne~ p e e"^, (24) 

to leading order. 

Note that the above calculation can of course be performed for general rj > and n- 
sphere. The general case is excluded for the sake of brevity, however one case that provides 
some insight is the "unit-disk" model, corresponding to the limit rj — > oo with (3 scaled so 
that fi~ x l rt approaches a limit, tq. We then have 

{1 r < rn 
(25) 
r > ro 

leading to 

/ F(r)dr « / F(r)dr = 2vrr 2 (26) 
Js 2 Jr 2 

for either metric (actually it is exact for the Euclidean metric). We then calculate on M. 2 
for r 2 < 2ro 



K(r 2 ) = 2itr Q + -z-\/4rg — r\ - 2r arccos 

(27) 

= nr 2 + 2r r 2 - + ... 

12r 
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expanding for small r 2 as this leads to the main contribution to P/ c . The latter comes to 



fc 



1 - Ne~ m ° p 



7T 

4r> 



(28) 



where the last term is the second order contribution. 



We now make an important observation. From equation (24) we notice that in the high 
density limit the second order term in our expansion of Pj c has exponent of order ~ — ^¥ 
and hence is exponentially smaller than the first order term. This is due to the probabilistic 
nature of the connections, two nodes need more excluded volume on average to be isolated 
than a single node. For the unit disk model, however, the two nodes can be very close with 
a probability that is an algebraic function of their mutual distance, requiring roughly the 
same excluded volume, and leading to a second order effect which differs from the first only 
by an algebraic function. 

This means that higher order effects appear to be important mostly for the unit disk 
model. For the probabilistic model which is more realistic for applications such as wireless 
networks, we are justified to use only the first order results for the discussion of boundary 
effects in confined geometries; to this we now turn. 



3 Boundary effects 

In this section, we lift the homogeneity assumption and consider the first and second order 
approximations of Pf c only to discover that contrary to popular belief and practice, bound- 
ary effects matter. This is a central observation of the current work, which we detail and 
discuss in the following subsections. 

3.1 Inhomogeneous first and second order approximations 

Returning to the first order equation ([7]) but no longer assuming homogeneity gives 

jv i / i - 



p '^ l -vlA l -vK H{r ^) dr2 (29) 

= 1 - p [ e'Pfv H ^ d ^ (1 + OiN- 1 )) dr 2 , 
Jv 

for large N. This equation was recently given in [33] (equation (8)) with V scaled expo- 
nentially with p thus ignoring any boundary effects. Recall that for this approximation we 
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only require that V 3> p or equivalently V" > vJV. This is a key difference from previous 
percolation approaches. 



Similarly, from equations (12) and (13), one can show that the second order approxi- 
mation of Pf c becomes 



N-2 




l-pj e- p ^ H(ri2)dri dr 2 -^ J J ^(r 12 )e-^ (ri ' r2) d ri dr 2 , 



(30) 



where 



K(r 1: r 2 ) = V - f (1 - #(r 13 ))(l - # (r 23 ))dr 3 , (31) 
Jv 



and in the last step of ( 30 ) we have ignored terms of order ~ N 1 since N is also assumed 
to be large. 



Equation (29) suggests that in the high density limit, the probability of having a single 



N — 1 connected cluster is dominated by nodes which are situated in "hard to connect" 



regions of the available domain V. This is because the outer integral in (29) is dominated by 



contributions where the integral in the exponential is small, for example at corners, edges 



and faces. The second order corrections in (30), as expected from our calculations in section 



2.4, are of secondary importance and do not offer further insight. It is important to note that 
our approach here contradicts the usual universality scenario found in statistical mechanics 
and emphasizes the dominant importance of boundaries (and in particular corners). We 
stress here that this observation does not depend on using Euclidean distance and is valid 
in any geometry and any dimension where the lack of connectivity is due to a situation 
involving a single disconnected node and an N — 1 cluster.. In such a case, the outage 
probability will be dominated by situations where that node has a small volume in range. 



Returning to the general Rayleigh fading link model of equation (19) we now demon- 
strate the above observation in a straight forward way. Suppose that r 2 is situated some- 
where on the boundary of the network domain Vclf Since direct connectivity is expo- 



nentially decaying, we may separate the dri integral and extract its leading order behavior 

/ ff(r 12 )dn « ( r^e-^dr) ( f da) = ^L, (32) 
Jv \Jo J \J J rjf3v 
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where Q = is the full solid angle in d dimensions and u G (0, f2) is the solid angle 



available from r 2 . In the case of rj = 2 the gamma functions cancel and equation (32) 

d 

simplifies to 2 ^. As disused before, scaling /3 = r^ 1 we recover the popular unit-disk 
model in the limit of rj — > oo, allowing for direct comparison with the probabilistic case of 
(19) and other earlier results. For example, if rj = d then equation (32) gives r^/d which is 



also what the unit-disk model gives. 



In general we expect that the outer integral in ( 29 ) will have contributions from bound 



ary regions as in (32), with a term of the order of exp(— pu f H(r)r d ~ 1 dr) for the smallest 
u, for example the pointiest corner, dominating Pf c at high density. It is clear from this 
argument that connectivity is indeed dominated from regions which are hard to connect 
and is controlled by the solid angle available to them. 

The above observation has brought forward a radically different understanding of con- 
nectivity in confined geometries, namely that full connectivity is dominated by the critical, 
hard to connect areas such as corners, edges and faces. Furthermore, this suggests the de- 
composition of the probability of a fully connected network Pf c into a sum of contributions 
due to boundary objects with different solid angles. In order to obtain a more in-depth 
understanding of this novel idea we now embark into a more detailed investigation by con- 
sidering a few example domains. Through these examples, the boundary effects will become 



clear thus leading to the main result of this section; a general formula (see equation (46)) 
for P fc . 

3.2 Example 1. Circle (r/ = 2) 

We start with a circular domain cr C M? of radius R. Using a Euclidean metric, the 
distance between two nodes is given by d(ri,r2) = | ri2 1 = \/l r il 2 + | r~2 1 2 — 2 1 ri 1 1 r*2 1 cos6>. 
Hence, for the case of rj = 2 we must first evaluate 

/ i?(ri 2 )dri = [ R (V ie -/ 3 (r?+'i-2nr 2 cos0) x \ d9dn 
Jc R Jo Jo v J 

(r 1 / (2r 1 r 2 /3)e- /3 ( r i +r 2)) dn, 



2vr 







where Io(x) is the modified Bessel function of the first kind. Unlike the ordinary Bessel 
function, which is oscillatory for real arguments, Io(x) is exponentially growing. When r 2 
is close to the center (i.e. r 2 ~ 0) we have that e -/3r2 io(2rir2/3) = 1 + so that the 
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integral becomes: 



2tt 



R 



(r 1 /o(2r 1 r 2 /3) e -^ r ? + ^)) dn = - ( 



1 - e 



0(r : 



7T 



(34) 



where we have assumed that (3R 2 3> 1. Notice that this is the same assumption made 



in equations (20) and (21) and will be consistently used throughout our calculations. 
Otherwise, if r 2 is not near the center of cr, we have the asymptotic relation Io(x) = 
^— (1 + 0(x~ 1 )) so that the integral becomes: 



2tt 



R 



(r 1 / (2r 1 r 2 /3)e- /3 ^ +r 2)) d n = 2vr J fr x - |== - e'^+^A d n 



7T 



dri 



(35) 



7T 

2£ 



erf 



+ erf 



/3r 2 



since the main contribution of this integral comes from r\ ~ r 2 . Matching the two solutions 



for small and large r 2 we obtain a nice approximation to equation (|33|) 

V^(fl-r 2 )l +1 



7T 



F(n 2 )dri rj (erf 

= ^/(r 2 ) 



In order to make progress, we approximate /(r) by 
fir) - 



(36) 



ci, for < r < a, 

c 2 — m(r — R), for a < r < R, 



(37) 



^t=2 when 



where c x = 2erf [y^f ] ~ 2, c 2 = erf [y/pR] « 1, m = (l - e - ^* 
V^5i? 3> 1 and o is their intersection approximating the location of the effective turning 
point of f(r). 



The outer integral in equation ( 29 ) can now be approximated to give 

~Pfc a #(ri2)dri 



dr 2 



CR 



R 



l-2irp / re p ^ f[r) dr 



1 - nRye 1 



7T 



^fPR p^R 



+ 0(R 



-2- 



2ttR\ -e ^ p I 1 

V 7T 



/3 



py/wR 



0(R- 
(38) 
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Figure 4: Left: (Color online) A plot of Pf c (p) in the circular domain c R of area A = 200 (/? = 1) (solid 
red) compared with the prediction of equation Q (dashed) and a numerical simulation (black). Right: The 



dashed curves show the contributions from the bulk (blue) and boundary (purple) term in equation (381 on 
a log-linear scale. The full curves are plots of P out (p) with dark red and black corresponding to theory and 
numerical simulation. 



It is clear from equation ( 38 ) that Pf c is composed of a bulk term and a boundary term 
with coefficients characteristic of the area and perimeter of cr, with corrections due to the 
curvature of the boundary which vanish as R — >■ oo. Moreover, Pf c is dominated by the 
boundary term in the high density limit. This can be seen on the right panel of Figure |4j 
and that this transition occurs at pt ~ ^ In ^^R as R ^ oo. 

The left panel of Figure [4] shows an encouraging comparison with a numerical Monte 
Carlo simulation. Nodes are randomly placed within a circular disc of area A = 200 and are 
connected with probability H{rij) (/3 = 1). The error at lower densities is of course expected 
as the theory developed above is only a first order approximation to Pf c . Nevertheless, the 
agreement with simulations substantially improves at high densities but is also good in 



mid-densities. Hence, equation (29) provides a useful path which one may take to predict, 



control, or even optimize and set benchmarks for achieving full network connectivity. 
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3.3 Example 2. Square (rj = 2) 



We now consider a square domain sl of side L. The integral in the exponent of (29) can 
be evaluated exactly in Cartesian coordinates for 7] = 2 to give 



ff(ri 2 )dri 



-/K(*i-*>) 2 +(Wi-tt) a )) d Xl d Vl 



(39) 



7T 



where h(x) = (erf \f]i 



> (L-2x) 



4/3 
erf 



h(x 2 )h(y 2 



(L+2x) 
2 



Due to symmetry, we need only consider 



(L-2x) 
2 



+ 1 



the positive quadrant of sl i.e. when X2,y2 £ [0, j]- Hence h(x) [erf 
and may be further approximated by f(x) with R replaced by L/2 (see equation (37)) such 
that 



(40) 



Note however that it is crucial that the quadratic x 2 y2 term is not included in the expansion 
of h{x2)h(y2) as it is of higher order. Performing the integrals and multiplying by four we 
obtain a result 



P fc = 1 - L 2 pe~^ p (1 - OiL- 1 )) - AL^e~^ p (l - O^ 1 )) - ^e"^ 



(41) 



assuming that 3> 1. Note that the above calculation can naturally be extended in a 

straight forward way to include for general r\ > and higher dimensional d > orthotopes 
(hyper-rectangles) of non-equal sides. 



As expected, Pf c in equation (41 ) is composed of a bulk term, a boundary and a corner 
term with coefficients characteristic of the corresponding volume of the relevant object (e.g. 
area, perimeter, number of corners) and is dominated by the corner term in the high density 
limit. This is illustrated on the right panel of Figure [5] One would expect however the 
coefficient of the corner term to be 4 rather than 16. To understand why this is not so, we 
will consider a general angle term in the next subsection. Finally, we notice that there exists 
a parameter window < p < pt 2 where the edges of the square provide the dominant 
contribution to Pf c . The left panel of Figure [5] shows another encouraging comparison with 
numerical simulations. 
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Figure 5: (Color online) Left: A plot of Pf c (p) in the square domain sl of area A = 200 (/? = 1) (solid 
red) compared with the prediction of equation Q (dashed) and a numerical simulation (black). Right: The 
dashed curves show the contributions from the bulk (blue), the boundary (purple) and the corner (yellow) 



term in equation (41 1 on a log-linear scale. The full curves are plots of P ou t(p) with dark red and black 
corresponding to theory and numerical simulation. 



3.4 Example 3. Circle sector: General angle (77 > 0) 

In order to understand corner effects better, we now consider for general i)>0a circular 
sector C M 2 of central angle $ E (0, n) and circle radius R. We ignore the bulk 
and boundary contributions to Pf c , and concentrate on the corner term which we denote 
Cd(i?) for d = 2. Note that the distance between two nodes is now given by d(ri,r2) = 
\/|ri| 2 + I r 2 1 2 — 2|ri||r2| cos(#i — 62) with angular positions G (0, •&). Following the 
arguments of section 3.1, we expect that to leading order C 2 ($) is when r2 is situated near 
the corner i.e. when r 2 ~ 0. Hence we expand H(ri 2 ) about the corner in polar coordinates 
such that to leading order 

r-R 

F(n 2 )dn = lim / / ne - ^ ( 1 + T ] ^r[~ 1 r 2 cos(#i - 9 2 ) + 0(r$)) dndfli 
c R # R->°°Jq Jo v ' 

(42) 



and hence 



tfr(jj+r2r(ij(sin#2-sin(fl2-tf)) 2 



R -pi {v) 2 {v) 2 2 — — -+0<V|) 

C 2 (i?) = lim / / r 2 e V vP* J dr 2 d6 2 



R— >oo 



JO 



r 

2 "P 



p 2 sint?r (1 + i] 



-n 2 ' 
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Reassuringly in the case of a square with ij = 2we have that 4C2 ( 



!6/3 "JSP 



p^TV 



e 4 P as in 



equation (41). Furthermore, from (43) we identify three main regimes as follows 



for < 1, 



(1?- f) + f) 2 ), for 7? « f , 



\ TTp [TV — 77) TTp V <" ' 

for the case of 77 = 2 and are illustrated in Figure [6j Finally, £2(1?) has a minimum at 

2/3 



(44) 



it — arctan 



(45) 



Physically, the picture shown in Figure [6] makes sense in the following way: as t? — > n, 
the corner turns into a wall of infinite length and thus C2(0) diverges. This argument is 
supported by the fact that in this limit, the exponent is ~ — and hence of the same order 



as that of the perimeter term in equation (41 ) 



3.5 A General formula 

In high-dimensional domains Vcf* (d > 3), boundary effects are due to intersections of 
hyperplanes which the above examples and techniques can generalize to account for. As a 
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Figure 7: Left: (Color online) Comparison of Pf c (p) from theoretical predictions of (461 (dashed curves) 



with direct numerical simulation (jagged line) of two random graphs inside a cube of side L = 7 for the unit- 
disk connectivity function with ro = 1 (blue) and the probabilistic Rayleigh fading link model with r\ — 2 
(red). Right: The corresponding P ou t(p) is plotted on a log- linear scale emphasizing the good agreement 
between theory and simulations as well as the difference between the two models. 



result Pf c can be expressed as a sum of contributions due to objects of different codimension 
i = 0, 1, . . . d (with i = corresponding to the bulk/ volume term) 

d 

P fc^ 1 " £ £ /'' ' G , V ,/ (46) 

i=0 ji 

where d £ N is the space dimension, Gj i is a geometrical factor for each object j of codimen- 
sion i while is the corresponding d — i dimensional volume of the object with solid angle 
Uj t £ (0, f2) and % = Jq°° r d ~ l H(r)dr. It is clear that for i = and 1, the second sum in (46) 
contains only one term. The remaining one dimensional integral is easily computed for typ- 
ical connectivity functions Hij (also see |37J). Corrections due to curved hyper-surfaces 



are expected to give algebraic corrections as in (38). Note however that is Hij depen- 
dent while ~Vj i is not. In the case of H(r) = e~^ T ' 2 we find for example for d-dimensional 

i(d-l) 

hyper-rectangles where all hyperplanes meet at right angles Gj i = 2*^~ 1 ^ \J^J 2 while 

for the unit-disk model with range ro we find Gj i = 2 l(l ~ VK?— u wnere = r(n/2+i) 
is the volume of an n-dimensional sphere of radius ro. These two cases are compared in 
Figure [7] for a cube of side L = 7. 

The above formula is a high density expansion giving the probability of achieving a 
fully connected network of nodes in confined geometries. For all practical purposes (d < 



3) equation (46) can account for most interesting (from an engineering point of view) 
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Figure 8: (Color online) Left: Comparison of Pf c (p) from theoretical predictions of (46 1 (solid curves 



with direct numerical simulation (black) of two random graphs inside circular sectors Cr 1! -& 1 (blue) and 
cr 2i # 2 (red) using /3 = 1 and r\ — 2. The sectors are of equal perimeters Vij = 70 but with $i = n/4 
and $2 = 37r/4. The dashed curves correspond to conventional wisdom (i.e. the bulk contribution of 
equation Q). Right: The corresponding P ou t(p) is plotted on a log-linear scale emphasizing the good 
agreement between theory and simulations. 



domains in which a network may be deployed. Furthermore, its closed and simple format 
emphasizes the logical decomposition of the domain into objects of different full connectivity 
importance. This is illustrated in Figure [8] comparing two circular sectors (d = 2) of angles 
i?i = 7r/4 and i?2 = 37r/4 but of equal perimeters Vi 1 = 70. Notice that the density at 
which the numerical simulation results become distinct (p ~ 3) in the left panel of Figure [8] 
is also where the corner term of c/j 1j ^ 1 overtakes the common perimeter term (shown in 
black) in the right panel of Figure [8j 



4 Conclusion 

In this paper we have attempted to understand how boundary effects can influence full 
connectivity in confined geometries. Based on the fact that at very high densities the 
probability of full connectivity Pf c is simply the complement of the probability of a single 
isolated node, we have developed a cluster expansion approach which classifies terms with 
respect to their largest connected component. Assuming a homogeneous system and aver- 
aging over the available space of the domain, we extract the leading and next-to-leading 



order behavior of Pf c as is given in equations (|8j) and (17) respectively. Although our ap- 



proach can be consistently expanded to obtain even higher order corrections to Pfc-> using 
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an exponentially decaying connectivity test-function^] (equation (19)), we showed that cor- 
rection terms are exponentially small. This is because nodes are 'connected in probability' 
(in this case exponentially) rather than in an all-or-nothing way. 

The first main result of this paper follows from lifting the homogeneity assumption to 



obtain equations (29) and (30), which confirm our initial observation, namely, that in the 
high density limit, Pj c is dominated by nodes which are situated in "hard to connect" re- 



gions, characterized by their solid angle. Using (19) we then instructively worked through 
three example domains (circle, square and circular sector) in order to reach our second 



main result, a general formula for Pt c (equation (46)). The closed and simple format of 



(46 ) emphasizes the logical decomposition of the domain into objects of different full connec- 
tivity importance, according to their codimension. Finally, a comparison of our theoretical 
predictions with direct numerical simulations, in a variety of settings, demonstrated the 
inaccuracy of conventional percolation models while emphasizing the benefits of including 
boundary effects. 

In summary, full connectivity at high node densities is dominated by local geometric 
boundary effects such as corners, edges and faces. These contributions have universal prop- 
erties, distinct but complementary to those of previous percolation approaches, which are 
typically limited to bulk effects or use a scaling that removes boundary terms (see |34J). 
As a result, our findings bring forward a radically different understanding of connectivity 
in confined geometries while also providing useful formulas for the probability of full con- 
nectivity, crucial for the design of reliable wireless mesh networks |38j . Moreover, sufficient 
quantitative detail is available for analyzing and determining system parameters in order 
to mitigate boundary effects (see [37]). Conversely, boundary effects can be harnessed to 
avoid full connectivity. Such approaches can be useful in physically important models such 
as the spread of forest fires [18] , epidemics [21] , or mobile phone viruses [23] , given details 
of a specific model for connectivity. Finally, one may also consider the inverse problem of 
"connecting the shape of a drum" [39j to characterize unknown domains containing random 
networks. More generally, boundary effects are of major interest in the statistical mechan- 
ics of systems with very small structures such as in conducting carbon nanotubes [30] or 
very large and highly connected social and financial networks \26\ [27] where the geometry 
is likely to be dynamic. 



4 This was taken from a standard Rayleigh fading link model (also see [37] ) 
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